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Abstract: We construct a universal code for stationary and memoryless classical- 
quantum channel as a quantum version of the universal coding by Csiszar and 
Korner. Our code is constructed by the combination of irreducible representa- 
tion, the decoder introduced through quantum information spectrum, and the 
packing lemma. 



1. Introduction 

The channel coding theorem for a stationary and memoryles^ (classical-)quantum 
channel has been established by combining the direct part shown by Holevo [T] 
and Schumacher- Westmoreland [2| with the (weak) converse (impossible) part 
which goes back to 1970's works by Holevo ^JJ. Its strong converse part has 
been shown by Ogawa and Nagaoka^ and Winter [6]. This theorem is a fun- 
damental element of quantum information theory [7]. After their achievement, 
Ogawa and Nagaoka [8] and Hayashi and Nagaoka[9] constructed other codes 
attaining the capacity. However, since the existing codes depend on the form of 
the channel, they are not robust against the disagreement between the sender's 
frame and receiver's frame. In the classical system, Csiszar and Korner [TD] con- 
structed a universal channel coding, whose construction does not depend on the 
channel and depends only on the mutual information and the 'type' of the input 
system, i.e., the empirical distribution of code words, whose precise explanation 
will be explained in Section [3l Such a universal code for the quantum case was 
also constructed for variable- length source coding [TTIIT^ and fixed- length source 
CO ding [To]. 

Concerning the quantum system, Jozsa et al. [13] constructed a universal 
fixed-length source coding, which depends only on the compression rate and 



Throughout the paper, a stationary memoryless channel without using entangled input 
states is simply referred to as a stationary memoryless channel. 
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attains the minimum compression rate. Hayashi jl4j discussed the exponential 
decreasing rate of its decoding error. Further, Hayashi and Matsumoto [15] con- 
structed a universal variable-length source coding in the quantum system. How- 
ever, any universal coding for classical-quantum channel was not constructed. In 
fact, the universal coding is required when the receiver cannot synchronize his 
frame with the sender's frame. 

In the present paper, we construct a universal coding for a classical-quantum 
channel, which attains the quantum mutual information and depends only on 
the coding rate and the 'type' of the input system. In the proposed construction, 
the following three methods play essential roles. One is the decoder given by the 
proof of the information spectrum method. In the information spectrum method, 
the decoder is constructed by the square root measurement of the projectors 
given by the quantum analogue of the likelihood ratio between the signal state 
and the mixture state 9,16 . 

The second method is the irreducible decomposition of the dual representa- 
tion of the special unitary group and the permutation group. The method of ir- 
reducible decomposition provides the universal protocols in quantum setting [131 
[T7 l fT9 l [2T | ll5 p i8 [ [20]. However, even in the classical case, the universal channel 
coding requires the conditional type as well as the type[in|. In the present pa- 
per, we introduce a quantum analogue of the conditional type, which is the most 
essential part of the present paper. 

The third method is the packing lemma, which yields a suitable combina- 
tion of the signal states independent of the form of the channel in the classical 
case[10]. This method plays the same role in the present paper. 

The remainder of the present paper is organized as follows. In section [21 
we give the notation herein and the main result including the existence of a 
universal coding for classical-quantum channel. In this section, we presented the 
exponential decreasing rate of the error probability of the presented universal 
code. In section [31 the notation for group representation theory is presented and 
a quantum analogue of conditional type is introduced. In section [4l we give a 
code that well works universally. In section [5l the exponential decreasing rate 
mentioned in section [2] is proven by using the property given in section [3l 



2. Main Result 

In the classical-quantum channel, we focus on the set of input alphabets X :— 
{1, . . . ,k} and the representation space Ti of the output system, whose dimension 
is d. Then, a classical-quantum channel is given as the map from X to the set of 
densities on TC with the form i i-^ W{i). The n-th discrete memoryless extension 
is given as the map from <Y" to the set of densities on the n-th tensor product 
system 7i®". That is, this extension maps the input sequence i = 
to the state Wn{in) '■= W{ii) ^ ■ ■ ■ W(i„). Sending the message {1, . . . , M„} 
requires an encoder and a decoder. The encoder is given as a map (/?„ from the 
set of messages {1, . . . , M„} to the set of alphabets X", and the decoder is given 
by a POVM F" = {Yl'jfi^. Thus, the triplet <Pn := (M„,(^„,r) is called a 
code. Its performance is evaluated by the size \<Pn\ :~ Mn and the average error 
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probability given by 

e[^n,W] _^TrV7„(^„(z))(/-y,"). 

i—l 

As mentioned in the following main theorem, there exists an asymptotically 
optimal code that depends only on the coding rate. 

Theorem 1 For any distribution p = {pi}i^i on the set of input alphabets 
X :— {1, . . .,k} and any real number R, there is a sequence of codes {$n}^=i 
such that 

lim — logekP„,Ty > max — 

ri^oo n *^ ^ J - o<t<i l+t 

lim - log |<Z>„| = R 
for any classical- quantum channel W , where 4'w,p{t) is given by 

k 

<l>wAt) :--(l-t)logTr(^KW^Wi-*)T^. 

i=l 

Note that the code {^n}'^=i does not depend on the channel W , and depends 
only on the distribution p and the coding rate R. 

The derivative of (/)w,p{t) is given as 

k 

<l>w,piO) = Hp, W) Tr W^(z)(log - log Wp) 

i=l 

k 

4=1 

When the transmission rate R is smaller than the mutual information I{p, W), 

0w.p(i) -tR 

max — > 

o<t<i l + t 

because there exists a parameter t e (0, 1) such that 0w,p(^) — tR> 0. That is, 
the average error probability W] goes to zero. 



3. Group representation theory 

In this section, we focus on the dual representation on the n-fold tensor product 
space by the the special unitary group SU{d) and the n-th symmetric group 
5,J3. For this purpose, we focus on the Young diagram and the 'type'. The 
former is a key concept in group representation theory and the latter is that in 
information theory [I^. When the vector of integers n = (ni, n2, . . . , n^) satisfies 



^ Christandl l22l contains a good survey of representation theory for quantum information. 
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the condition ni > n2 > . . . > Ud > and ~ the vector n is called the 

Young diagram (frame) with size n and depth d, the set of which is denoted as 
Y^. When the vector of integers n satisfies the condition > and J2i=i = 
the vector p = ^ is called the 'type' with size n, the set of which is denoted as 
T^. Further, for p e T,f , the subset of A"" is defined as: 

Tp := {x G A:'"|Thc empirical distribution of x is equal to p}. 

The numbers of these sets are evaluated as follows: 

\Y:!\<\T:^\<{n+ir-' a) 

< |Tp|, (2) 

where H{p) :— — X^iLi Pi logpi lO . Using the Young diagram, the irreducible 
decomposition of the above representation can be characterized as follows: 

neY^ 

where Un is the irreducible representation space of SU{d) characterized by n, 
and Vn is the irreducible representation space of n-th symmetric group Sn char- 
acterized by n. Here, the representation of the n-th symmetric group Sn is 
denoted as V : s G Sn Vg- For n gY^, the dimension of Un is evaluated by 

d(d-l) 

Then, denoting the projection to the subspace tin ® Vn as /„, we define the 
following. 

Pr^ ■= ^ = ...7/^., (4) 



dimW„ iS> V, 



Any state p and any Young diagram n G satisfy the following: 

Thus, H]), (O, and © yield the inequality 

n^\Y:!\pu,n>p'^''. (6) 

Next, we focus on two systems X and y ^ {1, . . . , ^}. When the distribution 
of X is given by a probability distribution p — (pi, . . . ,pd) on {1, . . . , d}, and 
the conditional distribution on y with the condition on X is given by V , we 
denote the joint distribution on A" x 3^ by pV and the distribution on y by 
p ■ V. When the empirical distribution of a; G A"" is . . . , the sequence 
of types V ~ {vi, . . . , Vd) £ T^i^ x • • • x T^^ is called a conditional type for a:jlO|. 
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We denote the set of conditional types for x by V{x^y). For any conditional 
type V for x, we define the subset of 3^": 

The empirical distribution of 1 
{{xi,yi), . .. , {x„,y„)) is equal to pV . J ' 

where p is the empirical distribution of x. 

We define the state for x e A"". For this purpose, we consider a spe- 
cial element x' — (1, . . . , 1, 2, . . . , 2, . . . , A:, . . . , fc). The state px' is defined as 

Px' '■= Pu,mi ® Pu,m2 ® ■ ■ ■ ® Pu,mk - For a general element x e X", we choose a 
permutation s G Sn such that x — sx' . Then, we define the state p^ is defined 
as Px ■= UsPx'Uj, where Us is the unitary representation of Sn. This state plays 
a similar role as the conditional type in the classical case. Using the inequality 
dB]), we have 

kd(d~l) , , 

n^^\Y,f\''Px>Wnix). (7) 

For ni e Y,f-^^ , n2 £ Y^^ , . • • , G , the density ® ® ' ■ ' ® Pn^ 
is commutative with the projector J„ for n £ F^. This fact implies that the 
density px is commutative with the density pu,n- This property is essential for 
the construction of the proposed decoder. 

4. Construction of code 

According to Csiszar and KornerpO^, the proposed code is constructed as follows. 



Tv{x) := y e 3^" 



Lemma 1 For a positive number 6 > 0, a type p £ T^, and a real posi- 
tive number R < H{p), there exist Mn :— e"*^^""^^ distinct elements M.n ■= 
{xi, . . . ,Xm„} C Tp such that their empirical distributions are p and 

\Tv{x) n {Mn \ {x})\ < |Tv(aj)|e-"(^(P)-«) 
for X e Mn C Tp and V £ V{x, X). 

This lemma can be shown by substituting the identical map into V in Lemma 
5.1 in Csiszar and K6rner[10 . Since Csiszar and Korner proved Lemma 5.1 
using the random coding method, we can replace 5 by That is, there exist 

Mn :— e"^^^ distinct elements Mn ■— {xi, . . . ,a;A/„} C Tp such that their 
empirical distributions are p and 

\Tvix) n {Mn \ {x})\ < |Tv(a;)|e-"(^(P)'^) (8) 

for X G Mn C Tp and V G V{x,X). Now, we transform the property ([5]) to a 
more useful form. 

Using the encoder Mn, we can define the distribution Pm„ ^ 

X G Mn 
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For any x e A"", we define the invariant subg roup Sx ^ Sji 

:= {s G Sn\s{x) = x}. 

Since x' e Tp implies that 

p"{x') ^ e-''"^P\ 
any element x' G Tv{x) n A^„ C Tp satisfies 



\Tv{x) n Al„| 1 |Tv(a;) n {Mn \ {x})\ 



\Tv{x)\ \Mn\ 



\Tv{x)\\Mn\ 



<e-«ff(p)gv^^p»(3.')gv^ 



(9) 

when the conditional type V is not identical. Relation © holds for any x'{j^ 
x) G Mn because there exists a conditional type V such that x' G Tv{x) and 
V is not identical. 

Next, for any x G A"" and any real number C„, we define the projection 

P{x) := {/9a, - C„/9c/,ri > 0}, 

where {X > 0} presents the projection ^^^.^ .^q Ei for a Hermitian matrix X with 
the diagonalization X = XiEi. Remember that the density px is commutative 
with the other density pu,n- Using the projection P{x), we define the decoder: 



r.,:= P{x) Fix') 



xeMr, 



In the following, the above-constructed code (e"^ ^ , Mn, {Yx}xeM„ ) is de- 
noted by ^u,niP,R)- 



5. Exponential evaluation 

Hayashi and Nagaoka[5] showed that 

I-Yx> <2{I-P{x'))+4: Y ^(^)- 

Then, the average error probability of ^Pi/.n {p, R) is evaluated by 
Y TrWnix')iI-Yx,) 

E ^^Wn{x'){I~P{x')) + -^ J2 TrW^„K) 



x'eMn 



Y TrWn{x)iI - P{x)) 



x£M„ 



4Tr 



1-^ '"I x'(^x)^M^ 



xeM 



(10) 
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Since the density is commutative with the density pu,m we have 

(/ - P{X)) = {p^ - Cnpu,n < 0} < (11) 

for < t < 1. Since the density px is commutative with the density Wn{x), 
Wn{x)p'^^ is a Hermite matrix and ([7]) imphes that 



Using nil) and we have 

TTWr.{x)(I - P{X)) < TvWn{x)p,'p\j^^Ci 



(12) 



(13) 



Since the quantity TiWn{x){I — P{x)) is invariant for the action of the permu- 
tation and the relation ^ imphes that 



P 



."(.) = e-"-(-)>i!i±ip 



(14) 



for X G Tp, we obtain 



TrWn{x){I-Pix)) = -^ J2 TTWr.{x')iI - P{x')) 



<{n+l)^ ^ P^{x')TtW„{x'){I-P{x')) 

as'EX" 

<{n+lYn''^\Y:irCl,Tr[ P"(^')W^n(^')'"*)P*c/,n 



(15) 
(16) 



\d+ 



2 




ktd(d 


-1) 






ktd(d 


-1) 


2 




ktd(d 


-1) 



^d\ktr<t 



Tr 



V 



.x£X 



(17) 



1 \ n(l-t) 



(n+ir+'^\YTCi Tr ^ p(x)W-„(x)i- 



\xex 



(18) 



where (ITSI) . and p7p foUow from and Lemma [Din Appendix, 

respectively. 
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Next, we evaluate the second term of ()10|) using the invariant property of 



Tr 



= Tr 



= Tr 



= Tr 



<Tr 



P{x) E PmSx')W^{x') 



<e^Tr 
<e^Tr 
<e^Tr 



x'{^x)£Mn 

Tr [F(a;)W®"] 

d(d-l) , 

P{x)n^\Y:i\pu, 



P{x)n^\Y;t\C-'p.. 



n^\Y;^\C-'p.. 



r- d(d-l) , . 



(19) 

(20) 
(21) 
(22) 
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and the inequality P{x){pu^n — 



where (HH), ([201), and ^ follow from 

c-V^) < 0. 

For any t e (0, 1) and i? > 0, we choose \Mn\ := e"-^-^, C„ := e"(^+''(*)), 
and r(i) Since r{t) = <j)w,p{t) - + r{t)), from ^ and 

(P^ . the exponential decreasing rate of the average error probability is evaluated 
as 



lim — log e{<PuAP^ > mini^wjt) - t{R + r{t)),r{t)} 



4>w,p{t) - tR 



That is, when we choose := argmax^g^Q^^ 0H-,p(t) tfl ^ |7V1„| := e"^ and 
C„ := e"(^+''(*°», we obtain 

lim — log£(^>^.„(p,i?),M/)> max 



for any channel W . Therefore, we obtain Theorem [T] 



6. Discussion 



We have constructed a universal code attaining the quantum mutual information 
based on the combination of information spectrum method, group representation 
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theory, and the packing lemma. The presented code weU works because any 
tensor product state p®" is close to the state pu,n- Indeed, Krattenthaler and 
Slater |23j demonstrated the existence of the state an such that ^D{p'^^\\an) 
n for any state p in the qubit system as a quantum analogue of Clarke and 
Barron's result [53]. Its d-dimensional extension is discussed in another paper [53]. 

Further, Hayashi |26| derived an exponential decreasing rate of error probabil- 
ity in classical-quantum channel, which is maxf:o<i<i ~(logX]i Pi Tr[T/F(i)^~*VFp])- 
tR. Since 

^ ^ e*(«+'^W)maxTr(y p,T4^(*)i-*)a* 

i 

i i 

we obtain 

max ~{logyp,Tr[W{i)^~*W!,])~tR> max ^w,p{t)-tR _ 

t:0<t<l ^ L P'^ - t:0<t<l 1+t 

i 

That is, the obtained exponential decreasing rate is smaller than that of Hayashi 
However, according to Csiszar and Korner [lOj . the exponential decreasing rate 
of the universal coding is the same as the optimal exponential decreasing rate 
in the classical case when the rate is close to the capacity. Hence, if a more 
sophisticated evaluation is applied, a better exponential decreasing rate can be 
expected. Such an evaluation is left as a future problem. 
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A. Maximization 

The following lemma is used for the derivation in Section (5] 
Lemma 2 When X is a positive semi-definite, we have 

maxTrAcj* = (TrAT^)^"* (23) 

(T 

for < t < 1, where a is a density matrix. 
Proof: First, we prove 



(24) 
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by the Lagrange multiplier method. Let A be the Lagrange multiplier. Then, 

0^J2({i\X\t)tql-' + X)Sq, 

i 

Thus, 

= {l\X\i)tql-^ + X. 

That is, 

Then, when the maximizing qi has the form C{i\X\i)'^ with the normaliz- 
ing constant C, the constant C has the form C = j—. Substituting 



Since 



T— into q.i, we obtain 



J2{m)r^] = TtX (J2{^\^X\^)rh 



the maximum maX(j TrXa* is given when we choose the basis {\i)} maximizing 

L^X|i)~. Since the function a; i— » x~ is a convex function, {i\7^^X\i)~ 
(i\{^X)^\i). Therefore, 



/ \ 1-* 



^(z|X|z)T^ <(TrXT^) 



1-t 



The equality holds when we choose the basis as the eigenvectors of X. 

Therefore, we obtain (l23t. ■ 
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